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Abstract 

This paper describes a computational, declarative 
approach to prosodic morphology that uses invio- 
lable constraints to denote small finite candidate sets 
which are filtered by a restrictive incremental opti- 
mization mechanism. The new approach is illustrated 
with an implemented fragment of Modern Hebrew 
verbs couched in MicroCUF, an expressive con- 
straint logic formalism. For generation and parsing 
of word forms, I propose a novel off-line technique to 
eliminate run-time optimization. It produces a finite- 
state oracle that efficiently restricts the constraint in- 
terpreter's search space. As a byproduct, unknown 
words can be analyzed without special mechanisms. 
Unlike pure finite-state transducer approaches, this 
hybrid setup allows for more expressivity in con- 
straints to specify e.g. token identity for reduplica- 
tion or arithmetic constraints for phonetics. 

1 Introduction 

Prosodic morphology (PM) circumscribes a number 
of phenomena ranging from 'nonconatenative' root- 
and-pattern morphology over infixation to various 
cases of reduplication, where the phonology strongly 
influences the shape of words by way of obedience 
to structural constraints defining wellformed morae, 
syllables, feet etc. These phenomena have been diffi- 
cult to handle in earlier rule-based treatments (Sproat 
1992, 159 ff). Moreover, as early as Kisseberth 
(1970) authors have noted that derivational accounts 
of PM are bound to miss important linguistic gen- 
eralizations that are best expressed via constraints. 
Kisseberth showed that verb stems in Tonkawa, a 
Coahuiltecan language, display a complex V/0 al- 
ternation pattern when various affixes are added (fig. 
1). This leads to more and more complicated vowel 
deletion rules as the fragment is enlarged. In contrast, 
a straightforward constraint that bans three consecu- 
tive consonants offers a unified account of the condi- 
tions under which vowels must surface. Later devel- 



l to cut' 'to lick' 

picn-O? netl-O? (3sg.obj.stem-3sg.subj.) 

we-pcen-o? we-ntal-o? (3pl.obj.-stem-3sg.subj.) 
picna-n-o? netle-n-o? (3sg.obj.stem-prog.-3sg.subj.) 
p(i)c(e)n(a) n(e)t(a)l(e) stems 

Figure 1 : Tonkawa verb forms with V/0 effects 

opments have refined constraints such as *CCC to re- 
fer to syllable structure instead: complex codas and 
onsets are disallowed. At least since Kahn (1976), 
Selkirk (1982), such segment-independent reference 
to syllable structure has been standardly assumed in 
the generative literature. 

Astonishing as it may be, even the latest computa- 
tional models of PM phenomena apparently eschew 
the incorporation of real prosodic representations, 
syllabification and constraints. Kiraz (1996) uses 
multi-tape two-level morphology to analyze some 
Arabic data, but - despite the suggestive title - must 
simulate prosodic operations such as 'add a mora' 
by their extensionalized rule counterparts, which re- 
fer to C or V segments instead of moras. There is no 
on-line syllabification and the exclusive use of lexi- 
cally prespecified syllable-like symbols on a separate 
templatic pattern tape renders his approach vulnera- 
ble to postlexical resyllabification effects. Similarly, 
Beesley (1996) seems content in employing a great 
number of CV templates in his large-scale finite-state 
model of Arabic morphology, which are intersected 
with lexical roots and then transformed to surface re- 
alizations by various epenthesis, deletion and assim- 
ilation rules. Beesley states that further application 
of his approach to e.g. Hebrew is foreseen. On the 
downside, however, again there is no real prosody in 
his model; the relationship between template form 
and prosody is not captured. 

Optimality Theory (OT, Prince & Smolensky 
1993), as applied to PM (McCarthy & Prince 1993), 
does claim to capture this relationship, using a 



ranked set of violable prosodic constraints together 
with global violation minimization. However, to date 
there exist no sufficiently formalized analyses of 
nontrivial PM fragments that could be turned into 
testable computational models. The OT framework 
itself has been shown to be expressible with weighted 
finite-state automata, weighted intersection and best- 
path algorithms (Ellison 1994) if constraints and 
OT's GEN component - the function from under- 
lying forms to prosodified surface forms - are reg- 
ular sets. A recent proposal by Karttunen (1998) 
dispenses with the weights while still relying on 
the same regularity assumption. Published PM anal- 
yses, however, frequently make use of constraint 
parametrizations from the ALIGN family, which re- 
quires greater than regular power (Ellison 1995). 
Further developments of OT such as correspondence 
theory - extensively used in much newer work on 
PM - have not received a formal analysis so far. Fi- 
nally, although OT postulates that constraints are uni- 
versal, this metaconstraint has been violated from the 
outset, e.g. in presenting Tagalog -um- as a language- 
specific parameter to ALIGN in Prince & Smolen- 
sky (1993). Due to the convincing presentation of 
a number of other forceful arguments against con- 
straint universality in Ellison (to appear), the case for 
language-specific constraints must clearly be seen as 
reopened, and - as a corollary - the case for con- 
straint inviolability as well. 

Declarative Phonology (DP, Bird 1995, Scobbie 
1991) is just such a constraint-based framework that 
dispenses with violability and requires a monostratal 
conception of phonological grammar, as compared 
to the multi-level approaches discussed above. Both 
abstract generalizations and concrete morphemes are 
expressed by constraints. DP requires analyses to 
be formally adequate, i.e. use a grammar descrip- 
tion language with formal syntax and semantics. As 
a consequence, Chomsky's criteria for a generative 
grammar which must be "perfectly explicit" and "not 
rely on the intelligence of the understanding reader" 
(Chomsky 1965, 4) are automatically fulfilled. DP 
thus appears to be a good starting point for a restric- 
tive, surface-true theory of PM that is explicitly com- 
putational. 

The rest of this paper reviews in informal terms the 
theory of Walther (1997) (section 2), showing in for- 
mal detail in section 3 how to implement a concrete 
analysis of Modern Hebrew verbs. Section 4 explains 
a novel approach to both generation and parsing of 
word forms under the new theory. The paper con- 
cludes in section 5. 



2 Declarative Prosodic Morphology 

Focussing on cases of 'nonconcatenative' root-and- 
pattern morphology, Declarative Prosodic Morphol- 
ogy (DPM) starts with an intuition that is opposite to 
what the traditional idea of templates or fixed phono- 
logical shapes (McCarthy 1979) suggests, namely 
that shape variance is actually quite common and 
should form the analytical basis for theoretical ac- 
counts of PM. Besides the Tonkawa case (fig.l), 
shape variance is also at work in Modern Hebrew 
(MH) inflected verb forms (Glinert 1989), see fig. 
2. 1 Here we see a systematic V/0 alternation of both 
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Figure 2: Modern Hebrew \f g.m.r 'finish' (Bl) 

stem vowels, depending on the affixation pattern. 
This results in three stem shapes CVCVC, CVCC and 
CCVC. Any analysis that simply stipulates shape se- 
lection on the basis of specific inflectional categories 
or phonological context (e.g. 3sg.f V 3pl or -V — ► 
CVCC s t em / Bl past) misses the fact that the shapes, 
their alternating behaviour and their proper selec- 
tion are derivable. Derivational repairs by means of 
'doubly open syllable' syncope rules (/ga. ma.r-a./ — ► 
/.gam.ra./) are similarly ad hoc. 

• A first step in developing an alternative DPM 
analysis of MH verbs is to explicitly recognize al- 
ternation of an element X with zero - informally 
written (X) - as a serious formal device besides 
its function as a piece of merely descriptive no- 
tation (cf. Hudson 1986 for an earlier application 
to Arabic). In contrast to nonmonotonic deletion 
or epenthesis, (X) is a surface-true declarative ex- 
pression (Bird 1995, 93f.). The reader is reminded 

'Regular MH verbs are traditionally divided into seven ver- 
bal classes or binyanim, B1-B7. Except for B4 and B6, which 
regularly act as passive counterparts of B3 and B4, the semantic 
contribution of each class is no longer transparent in the modern 
language. Also, in many cases the root (written \f C1.C2.C3) is 
restricted to an idiosyncratic subset of the binyanim. 

An a-templatic treatment of MH prosodic morphology was 
first proposed by Bat-El (1989, 40ff.) within an unformalized, 
non-surface-true, non-constraint-based setting. 



that DP sees grammar expressions as partial for- 
mal descriptions of sets of phonological objects. The 
former reside on a different ontological level from 
the latter, in contrast to traditional object-to-object 
transformations on the same level. Hence a prelim- 
inary grammar expression g{V\)m{V2)r for a He- 
brew stem (with abstract stem vowels) denotes the 
set {gmr, gVimr, gmV^r, gVimX^r}. Note that the 
(X) property as attributed to segmental positions is 
distinctive - in contrast to stem vowels root seg- 
ments do not normally alternate with zero, and nei- 
ther do affix segments in an important asymmetry 
with stems. This point is reinforced by the exceptions 
that do exist: phonologically unpredictable C/0 al- 
ternation occurs in some MH stems, e.g. natan/lakax 
'he gave/took' vsji-ten /ji-kax 'he will give/take'; by 
surface-true (n/l) encoding we can avoid diacritical 
solutions here. 

• Step two uses concatenation to combine indi- 
vidual descriptions of stems and affixes, besides con- 
necting segmental positions within these linguistic 
entities. Since, as we have just seen, a single de- 
scription can denote several objects of varying sur- 
face string length, concatenation Q at the description 
level is actually powerful enough to describe 'non- 
concatenative' morphological phenomena. In DPM 
these do not receive independent ontological status 
(cf. Bird & Klein 1990 and Gafos 1995 for other for- 
mal and articulatory-phonological arguments leading 
to the same conclusion). A more detailed description 
of the 3pl.fut. inflected form of \f g.m.r might there- 
fore be j~i"g"(Viym"(V2)r"'u. In order to allow for 
paradigmatic 2 generalizations over independent enti- 
ties such as root and stem vowel pattern within con- 
catenated descriptions, a hierarchical lexicon con- 
ception based on multiple inheritance of named ab- 
stractions can be used (cf. Riehemann 1993). 

• Step three conjoins a word form description with 
declarative syllabification and syllable structure 
constraints in order to impose prosodic well- 
formedness conditions. For Modern Hebrew (and 
Tonkawa), the syllable canon is basically CV(C). 
Expressed in prosodic terms, complex codas and 
onsets are banned, while an onset must precede each 
syllable nucleus. These syllable roles are established 
in the first place by syllabification constraints that 
exploit local sonority differences between successive 
segments (Walther 1993). Alltogether, the ensemble 

2 See Walther (1997) for a discussion of various ways to de- 
rive rather than stipulate the syntagmatic pattern of alternating 
and non-alternating segmental positions within stems. 



of prosodic constraints indeed succeeds in narrow- 
ing down the set for the 3sg.m past tense form 
to {*.gmr.,*.gamr.,*.gmar.,\.ga.mar.} = 
/gamar/. For 3pl. future tense Bl, how- 
ever, an unresolved ambiguity remains: in 
{.jig.me.ru., .ji.gam.ru.}, only the first ele- 
ment is grammatical. 3 An important observation is 
that in general there can be no purely phonological 
constraint to disambiguate this type of situation. 
The reason lies in the existence of minimal pairs 
with different category. In our case, homophonous 
/.ji.gam.ru./ is grammatical as 3pl. fut. B2 'they 
will be finished'. We will return to the analysis of 
such cases after proposing a specific disambiguation 
mechanism in the next step. 

• Step four eliminates the remaining ambiguity 
by invoking an Incremental Optimization Princi- 
ple (IOP): "For all (X) elements, prefer the zero al- 
ternant as early as possible". "Early" corresponds to 
traditional left-to-right directionality, but is meant to 
be understood w.r.t. the speech production time ar- 
row. "As possible" means that IOP application to 
a (X) position nevertheless realizes X if its omis- 
sion would lead to a constraint conflict. Hence, 
the IOP correctly rules out the second element 
of {.jig.me.ru.,*. ji.gam.ru.}. This is because 
.ji.gam.ru. represents a missed chance to leave out 
/a/, the earlier one of the two stem vowels. The reader 
may verify that the IOP as it stands also accounts 
for the Tonkawa data of fig. 1. Tonkawa lends even 
clearer support to IOP's left-to-right nature due to the 
larger number of V/0 vowels involved. As a limit- 
ing case, the IOP predicts the possibility of vowel- 
less surface stems, e.g. formed by two root conso- 
nants combined with vowel-final prefix and suffix. 
This prediction is strikingly confirmed by MH forms 
like te-lx-i 'you (sg.f.) will go' ^(h).l.\, ti-kn-u 
'you/they (pi.) will buy' \f k.n.0, ti-tn-i 'you (sg.f.) 
will give' \f (n).i.n; similar cases exist in Tigrinya. 
There can be no meaningful prosodic characteriza- 
tion of isolated CC stem shapes; only a wordform- 
based theory like the present one may explain why 
these forms exist. 

Note that, conceptually, IOP is piggybacked on au- 
tonomous DP-style constraint interaction. It merely 
filters the small finite set of objects described by 
the conjunction of all constraints. From another an- 
gle, IOP can be seen as a single context-free sub- 

3 Note that the prosodic view explains the pronounced influ- 
ence of (C)V affixes on the shape of the whole word: they pro- 
vide a nonalternating syllable nucleus which can host adjacent 
stem consonants. 



stitute for the various syncope rules employed in 
former transformational analyses. The claim is that 
fixed-directionality-IOP is the only such mechanism 
needed to account for PM phenomena. 

A distinguishing feature of the IOP is its poten- 
tial for an economical procedural implementation 
in incremental production. If constraint contexts are 
sufficiently local, the principle can locally decide 
over (X) nonrealizations and there will be very lim- 
ited backtracking through delayed detection of con- 
straint violation. Because the IOP stops after find- 
ing the first (X) realization pattern that violates no 
constraints, it has less formal power than global opti- 
mization which must always consider all candidates. 
Moreover, the IOP supports economic communica- 
tion, as it leads to shortest surface forms wherever 
possible. Finally, at least for root-and-pattern mor- 
phologies it can be argued to aid in speech per- 
ception as well. This is because the closed class of 
stem vowel patterns is less informative than open- 
class root segments. Since IOP-guided vowel omis- 
sion causes root segments to (statistically) appear 
at an earlier point in time from the acoustic onset 
of the word, the IOP hypothesis actively prunes the 
size of the cohort of competing lexical candidates. 
As a result, unambigous recognition will generally 
be achieved more quickly during continous lexical 
access. In sum, the IOP hypothesis not only pos- 
sesses overall psycholinguistic plausibility but actu- 
ally gives some processing advantage to shape vari- 
ance. If future research provides the necessary ex- 
perimental confirmation, we have yet another case of 
performance shaping competence. 

• Step five returns to the minimal pairs problem 
highlighted in step three: what to do with anti-IOP 
realizations such as that of /a/ in /.ji.gam.ru./ for B2 
fut.? The answer is (prosodic) prespecification. A 
surface-true constraint demands that B2 future and 
infinitive as well as all of B3, B4 must have an onset 
role for the first stem element. Thus, the possibility 
of IOP eliminating the first stem vowel is blocked by 
the constraint inconsistency that arises for the first 
stem element: either syllabification licenses an in- 
compatible coda or first and second stem segment 
together form an informed onset cluster. Note that if 
the constraint is lexicalized as part of the grammat- 
ical description of first stem position, it will have a 
maximally local context, referring to just the position 
itself. In general, DPM analyses pay much attention 
to proper attachment sites of constraints in order to 
maximize their locality. 

The MH verbal suffix -et (fem.sg.pres.) illustrates 



that sometimes another, segmental mode of prespec- 
ification is useful. This suffix is always preceded by 
a syllable ending in Id, although IOP application 
alone would e.g. prefer */gom.ret/ over /go.me.ret/ 
'she finishes'. The effect is morpheme-specific since 
other -VC suffixes behave as expected here: gomr- 
im/ot 'they (masc./fem.) finish'. One solution is to let 
part of the suffix definition be a constraint statement 
which demands that the segment two positions to its 
left must be a front vowel. This move captures both 
the stability and the quality of this vowel at the same 
time. (Apophony constraints ensure that the second 
stem vowel is never /i/ except in B5, which signifi- 
cantly has a different suffix -a in place of -et). Note 
that prespecifying the presuffixal segment to be in an 
onset position would not work. 

3 On implementing analyses 

In the following I show how to implement a toy frag- 
ment of MH verbs using the MicroCUF formalism, 
a typed, feature-based constraint-logic programming 
language suitable for natural language modelling. 
MicroCUF implements a subset of CUF (Dorre & 
Dorna 1993), inheriting its formal semantics. It was 
initially delevoped by the author to overcome effi- 
ciency problems with CUF's original type system. 
Additionally, its simpler implemenation provides an 
open platform for experimental modifications, as 
needed e.g. for parsing and generation with DPM. 
After briefly introducing the essentials of MicroCUF 
first, the MH analysis is developed and explained. 

3.1 The MicroCUF constraint formalism 

This section assumes a basic knowledge of Prolog. 
Like in Prolog, MicroCUF variables start with upper- 
case letters or _ , whereas relational symbols, fea- 
tures and simplex types start in lowercase; % marks 
a comment (fig. 3a). Relations like member are writ- 
ten in functional notation, with a notationally distin- 
guished result argument on the righthand side of : = 
and the relation symbol plus its (optional) arguments 
on the lefthand side. Subgoals like member (Elem) 
can occur anywhere as subterms. Instead of Prolog's 
fixed-arity first order terms, MicroCUF has typed 
feature terms as its basic data structures. As illus- 
trated in fig. 3b, subterms are explicitly conjoined 
with & or disjunctively combined with ; , while only 
type terms may be prefixed by the negation oper- 
ator ~. Features like left, cat are separated 
from their righthand value terms by : . Terms may be 
tagged by conjunction with a variable (vi), allowing 
for the expression of structure sharing through mul- 



tiple occurences of the same variable. Feature appro- 
priateness declarations ( : : ) ensure that both the term 
in which a feature occurs and its value are typed. For 
comparison, the result value of f s appears in HPSG- 
style notation under fig. 3c. 



a. 



% MicroCUF 
member (Elem) 
member (Elem) 



[Elem|_] . 

[_ I member (Elem) 



%Prolog 

member (Elem, [Elem|_] 
member (Elem, [_|Rest] 



member (Elem, Rest ) 



fs :=cat : (~ ( (b2;b3;b5) Spast) &V1 ) & left : cat : VI 
phonlist: : [ cat : categories ] . 



segmental positions to the left - a frequent situation 
in phonological contexts - we supplement it with a 
new feature left to yield bidirectional lists. For this 
doubly-linked list encoding to be wellbehaved, a step 
right followed by a step left is constrained to return to 
the same position Self (3), thus yielding cyclic fea- 
ture structures. Next, the value of the feature cat at 
the current position is connected with its right neigh- 
bour (3-4). In the face of our recursively structured 
lists this makes morphological and other global cate- 
gorial information locally accessible at each segmen- 
tal position. Finally, relations to incrementally clas- 
sify each segmental position as word-initial, medial 
or wordfinal and to impose prosodic constraints are 
added in (5-6). 

Basic concatenation is used in (10-12) to define 
X/0 positions. 



fs = 



phonlist 

cat CD -> ( (b2 VMV b5) A past ) 
phonlist 
cat CD 



left 



Segments . 
mark: marked & 



Figure 3: MicroCUF vs Prolog and HPSG notation 

3.2 Modern Hebrew verbs in MicroCUF 

Below I present a concrete MicroCUF grammar 
in successive pieces. It encodes a toy fragment 
of MH verbs and represents a simplified excerpt 
from a much larger computational grammar. For 
lack of space, the type hierarchy - specifying syl- 
lable roles, segments, morphological categories and 
word-peripheral position - and the definition of 
syllabify (formalized in Walther 1995) have been 
omitted. 

Let us start the explanation with a basic 
concatenation relation which adds a position Self 
in front of some string of Segments (1-6). 



1 cone (Self, Segments) := 

2 Self & 

3 right : (Segments&left : Self Scat : Cat ) & 
cat : Cat & 

classif y_position_in_word & 
constraints . 



8 classif y_position_in_word := 

9 right : self :' -ini' & lef t : self :' -fin' . 

Here, the familiar recursive first-rest encoding of 
lists translates into self -right features. This alone 
makes self and (arbitrarily long) right-context ref- 
erences possible. To support looking one or more 



mark : unmarked & 



10 x_0 (_, Segments) 

11 x_0 (X, Segments) 

12 cone (X, Segments 

13 

14 obi (X, Segments) 

15 conc(X, Segments). 

16 

17 is (Segment) := self : seg : Segment . 

The first clause of x_0 (10) realizes the zero alternant 
by equating in its second argument the Segments to 
follow with the result argument; the first argument 
holding x is unused. It gets used in the second clause 
(11-12), however, where it is prefixed to the follow- 
ing Segments by ordinary concatenation. The value 
of an additional feature mark specifies that realiz- 
ing an X position is marked w.r.t. the IOP, whereas 
no such value is prescribed in the first clause. In- 
stead, the marking there will be supplied later by ad- 
jacent instances of either the second x_0 clause or 
obi (14-15). The latter is the version of concatena- 
tion used for specifying obligatory, i.e. nonalternat- 
ing positions, which consequently are specified as 
unmarked. Alltogether these means yield fully spec- 
ified strings w.r.t. markedness information. We will 
see below how this simplifies an implementation of 
the IOP. 

As can be seen in the accessor relation is (17), 
phonological segments are actually embedded un- 
der a further feature seg. This treatment enables 
structure-sharing of segments independent of their 
syllable roles. 

The syllable shape constraint (18-25) shows first 
of all that syllable roles are modelled as types under 
self. 



18 shape := 



19 ( self: (nucleus & seg:vowel) & 

20 left : self : onset 

21 ; self: ("nucleus) & 

22 ( self:onset & lef t : self :( "onset ) 

23 ; self:coda & lef t : self :( ~ coda) 

24 ) 

25 ) . 
26 



27 constraints := syllabify & shape. 

Lines (19-20) capture the fact that syllable nuclei 
in MH are always vowels and that every syllable 
nucleus is preceded by an onset. In (21-22) a non- 
nuclear position that is an onset may only license 
preceding non-onsets, thus disallowing complex on- 
sets; similarly for codas in (23). In (27) generic 
syllabify is intersected with shape, since seg- 
mental positions must be prosodified and conform 
to language-specific shape restrictions. 

The constraints under (28-30), included for com- 
pleteness, merely ensure proper termination of seg- 
mental strings at the word periphery. 



28 word := self: ('+ini' & prom:up & onset) . 

29 end := left : self :(' +f in' & "onset) & 

30 self : ' -fin' . 

Prosodic prespecification (31-36) faithfully models 
what was stated in prose in section 2. 



31 prosodic_prespecif ication := 

32 ( cat: ( (b2& (~ (past;pres) ) ) ;b3;b4) & 

33 self:onset 

34 ; 

35 cat:(~((b2& ( ~ (past ; pres ) ) ) ; b3 ; b4 ) ) 

36 ) . 



We proceed in (37-41) with a rudimentary definition 
of first (vl) and second (v2) stem vowel which is 
sufficient for our toy fragment. 



37 


vl 


:= is(low) & cat: (past & bl;b7). 


38 


vl 


:= is (round & '-hi') & 


39 




cat : (bl & " past ) . 


40 


v2 


: = is (low) . 


41 




: = is ( front & ' -hi ' ) . 



The larger grammar mentioned above contains a full 
binary decision tree for each vowel. Still, even here 
one can see the use of type formulae like round & 
' -hi' to classify segments phonologically. 

Next come a number of exemplary inflectional af- 
fixes (42-79), again simplified. The zero affixes (42- 
45, 47-54) are phonologically just like the zero alter- 
nant in (10) in taking up no segmental space. 



42 % initial "0" prefix 

43 '#' (More) := More & self:'+ini' & 

44 cat: (~ fut & ~ infinitive & 

45 ( bl ; (~ pres & (b3 ; b4) ) ) ) . 
46 

47 % final "0" suffix 

48 ' #' (More) : = More & 



49 self:'-ini' & left : self :' +f in' & 

50 ( cat: (sg & masc & third & past) & 

51 left cleft: is ("front) 

52 ; cat: (sg & masc & third & pres) & 

53 left : left : is (front) 

54 ) . 
55 



56 % overt prefix 

57 ji(More) := selfc'+ini' & 

58 obi (is (i) , obi (is (i) , More)) & 

59 cat: (fut & third & ( (sg&masc) ; pi) & 

60 (bl ; b2) ) . 

The segmental content of all other affixes is spec- 
ified via possibly repeated instances of obi, since 
affixes are nonalternating. Apart from the respective 
categorial information, positional type information 
'+ini','+fin' ensures that prefixes and suffixes 
are properly restricted to wordinitial and wordfinal 
position. Note that the glide-initial ji- prefix specifies 
an initial HI (58) which will be prosodified as onset 
by means of syllabify. This representational as- 
sumption is in line with other recent work in phono- 
logical theory which standardly analyzes glides as 
nonsyllabic high vowels. Hence, even in MH we 
have a case where segmental classes and prosodic 
roles don't align perfectly. 

To control second stem vowel apophony, some 
suffixes demand (53,73) or forbid (51) front vowels 
two positions to their left. 



61 u(More) := obi (is (u) Sself : ' +f in' , More) & 

62 leftcleftcis (~ (vowel & "front)) & 

63 cat : (pi & ( (past & third) 

64 ; (fut & ~ first) ) ) . 
65 

66 a (More) :=obl (is(a)&selfc'+fin',More)& 

67 leftcleftcis (" (vowel & "front)) & 

68 cat: ((past & third & sg & fem) 

69 ; (pres & sg & fem & b5) ) . 



70 

71 et (More) : = 

7 2 obi (is (e) , obi (is (t ) Sself : ' +f in' ,More) ) & 

73 left : lef t : is (front) & 

74 cat: (pres & sg & fem & ~b5) . 
75 

76 im(More) := 

77 obi (is (i) , obi (is (m) Sself : ' +f in' ,More) ) & 

78 leftcleftcis (" (vowel & "front)) & 

79 cat: (pres S pi S masc) . 



Others posit the weaker demand vowel — > front 
(62,67,78), thus not forbidding consonantal fillings 
of the position adressed by left:left. 

The stem definition (80-82) for a regular triliteral 
is parametrized for the three root segments and the 
inflectional Suffixes to follow. 



80 stem(Cl, C2, C3, Suffixes) := 

81 obi (is (CI) , x_0 (vl, obi (is (C2) , 

82 x_0 (v2, obi (is (C3) , Suffixes))))). 

83 

84 affixes (Stem, 'f (end) ) := '#'(Stem). 

85 affixes (Stem, a (end) ) := '#'(Stem). 

86 affixes (Stem, et (end) ) := 't'(Stem). 

87 affixes (Stem, im(end)) := 't'(Stem). 

88 affixes (Stem, u (end) ) := yi (Stem) . 
89 

90 verbform([Cl & consonant, C2 & consonant, 

91 C3 & consonant], Category) := 

92 root_letter_tree ( [CI, C2, C3] ) & word & 

93 affixes ( prosodic_prespecif ication & 

94 stem (CI, C2, C3, Suffixes), 

95 Suffixes) & cat : Category . 

Given the informal description in section 2, the suc- 
cession of obligatory root and alternating stem vowel 
positions now looks familiar. It should be obvi- 
ous how to devise analogous stem definitions for 
quadriliterals (e.g. mijjev) and cluster verbs (e.g. 
flirt et). 

A rather simple tabulation of affixes lists 
(a subset of) the allowable prefix-suffix cooccur- 
rences in the MH verbal paradigm (84-88) be- 
fore everything is put together in the defini- 
tion for verb form, parametrized for a list of 
root segments and Category (90-95). Note how 
prosodic.prespecif ication is intersected with 
stem in (93-94), exploiting the power of the descrip- 
tion level to restrict stem realizations without diacrit- 
ical marking of stem vs affix domains on the object 
level. The subgoal root_letter_tree (92) will be 
discussed below. 

When proving a goal like verb form ( [g,m, r] , 
bl&third&pl&fut) , the MicroCUF interpreter 
will enumerate the set of all candidate result feature 
structures, including one that describes the gram- 
matical surface string jigmeru. An implementation 
of the IOP, to be described next, must therefore 
complement the setup established sofar to exclude 
the suboptimal candidates. While the subtle inter- 
twining of zero alternant preference and constraint 
solving described above has its theoretical merits, a 
much simpler practical solution was devised. In a 
first step, the small finite set of all candidate solu- 
tions for a goal is collected, together with numerical 



'disharmony' values representing each candidate's 
degree of optimality. Disharmony is defined as the 
binary number that results from application of the 
mapping {unmarked ^ OI2, marked ^ IO2} to 
the left-to-right markedness vector of a segmental 
string: e.g., joiioigoiaiom ir iuoi yields the dishar- 
mony value 010101 IOOIOIOI2 = 5525 i0 > 5477i = 
OIOIOIOIIOOIOI2 from joiioigoimoieioroiuoi- Step 
two is a straightforward search for the candidate(s) 
with minimal disharmony. 

4 Parsing and generation 

The preceding paragraph described how to com- 
pute surface forms given roots and categories. How- 
ever, this generation procedure amounts to an ineffi- 
cient generate-and-minimize mechanism which must 
compute otherwise useless suboptimal candidates as 
a byproduct of optimization. More importantly, due 
to the nonmonotonicity of optimization it is not obvi- 
ous how to invert the procedure for efficient parsing 
in order to derive root and category given a surface 
form. 

A first solution which comes to mind is to im- 
plement parsing as analysis-by-synthesis. A goal 
like ParseString&verbf orm (Root, Category) 
is submitted to a first run of the MicroCUF con- 
straint solver, resulting in instantiations for Root 
and Category iff a proof consistent with the 
grammar was found. With these instantiations, a 
second run of MicroCUF uses the full generate- 
and-minimize mechanism to compute optimal 
strings OptStringl, . . . ,OptStringN. The parse 
is accepted iff ParseString& (OptStringl ; 
. . . ; OptStringN) is consistent. Note that for this 
solution to be feasible it is essential that constraints 
are inviolable, hence their evaluation in the first run 
can disregard optimization. The main drawbacks of 
analysis-by-synthesis are that two runs are required 
and that the inefficiencies of generate-and-minimize 
are not avoided. 

The new solution recognizes the fact that bidi- 
rectional processing of DPM would be easy with- 
out optimization. We therefore seek to perform all 
optimization at compile time. The idea is this: ex- 
ploiting the finiteness of natural language paradigms 
we compute - using generate-and-minimize — each 
paradigm cell of e.g. the verbal paradigm of MH for 
a suitable root. However, while doing so we record 
the proof sequence of relational clause invocations 
employed in the derivation of each optimal form, us- 
ing the fact that each clause has a unique index in 
internal representation. Such proof sequences have 



two noteworthy properties. By definition they first 
of all record just clause applications, therefore nat- 
urally abstracting over all non-relational parameter 
fillings of top-level goals. In particular, proving a 
goal like verbform ( [g, m, r] , bl;b2) normally 
looses the information associated with the root and 
category parameters in the proof sequence represen- 
tation (although these parameters could indirectly in- 
fluence the proof if relationally encoded choices in 
the grammar were dependent on it). Secondly, we 
can profitably view each proof sequence as a linear 
finite state automaton (FSA ce u). Since a paradigm 
is the union of all its cells, a complete abstract 
paradigm can therefore be represented by a unique 
minimal deterministic FSA pam which is computed 
as the union of all FSA ce ^ followed by determiniza- 
tion and minimization. At runtime we just need to 
run FSA para as a. finite-state oracle in parallel with 
the MicroCUF constraint solver. This means that 
each proof step that uses a clause k must be sanc- 
tioned by a corresponding fc-labelled FSA transition. 
With this technique we are now able to efficiently re- 
strict the search space to just the optimal proofs; the 
need for run-time optimization in DPM processing 
has been removed. However, a slight caveat is nec- 
essary: to apply the technique it must be possible to 
partition the data set into a finite number of equiv- 
alence classes. This condition is e.g. automatically 
fulfilled for all phenomena which exhibit a paradigm 
structure. 

What are the possible advantages of this hybrid 
FSA-guided constraint processing technique? First 
of all, it enables a particularly simple treatment of 
unknown words for root-and-pattern morphologies, 
surely a necessity in the face of ever-incomplete 
lexicons. If the grammar is set up properly to ab- 
stract from segmental detail of the Root segments 
as much as possible, then these details are also ab- 
sent in the proof sequences. Hence a single FSA para 
merging these sequences in effect represents an ab- 
stract paradigm which can be used for a large number 
of concrete instantiations. We thus have a principled 
way of parsing words that contain roots not listed 
in the lexicon. However, we want the system not to 
overgenerate, mistakenly analyzing known roots as 
unknown. Rather, the system should return the se- 
mantics of known roots and also respect their verbal 
class affiliations as well as other idiosyncratic prop- 
erties. This is the purpose of the root_letter_tree 
clauses in (96-123). 



96 root_letter_tree ( [g | Rest ] ) := 

97 root_letter_tree_g (Rest) . 

98 root_letter_tree ( [~g|_] ) := 

99 cat : sem: ' UNKNOWN' . 
100 

101 root_letter_tree_g ( [m | Rest ] ) := 

102 root_letter_tree_gm (Rest ) . 

103 root_letter_tree_g ( [d | Rest ] ) := 

104 root_letter_tree_gd (Rest ) . 

105 root_letter_tree_g ( [ ~m& ~d | _] ) := 

106 cat : sem: ' UNKNOWN' . 
107 

108 root_letter_tree_gm ( [r ] ) := 

109 cat:(bl & sem:'FINISH' 

110 ; b2 & sem: 'BE FINISHED'). 

111 root_letter_tree_gm ( [ ~r | _] ) := 

112 cat : sem: ' UNKNOWN' . 

113 root_letter_t ree_gd ( [ r ] ) := 

114 cat:( bl & sem: ' ENCLOSE' 



115 ; b2 & sem: 'BE ENCLOSED' 

116 ; b3 & sem: 'FENCE IN' 

117 ; b4 & sem: 'BE FENCED IN' 

118 ; b5 & sem: 'DEFINE' 

119 ; b6 & sem: 'BE DEFINED' 

120 ; b7 & sem: 'EXCEL' 

121 ) . 



122 root_letter_t ree_gd ( [ ~ r | _] ) := 

123 cat : sem: ' UNKNOWN' . 



For each level in the letter tree a new terminal 
branch is added that covers the complement of all at- 
tested root segments at that level (99,106,112,123). 
This terminal branch is assigned an ' unknown' se- 
mantics, whereas known terminal branches record 
a proper semantics and categorial restrictions. Dur- 
ing off-line creation of the proof sequences we now 
simply let the system backtrack over all choices in 
the root_letter_tree by feeding it a totally un- 
derspecified Root parameter. The resulting FSA para 
represents both the derivations of all known roots 
and of all possible unknown root types covered by 
the grammar. While this treatment results in a ho- 
mogenous grammar integrating lexical and gram- 
matical aspects, it considerably enlarges FSA para . It 
might therefore be worthwhile to separate lexical ac- 
cess from the grammar, running a separate proof of 
root_letter_tree (Root) to enforce root-specific 
restrictions after parsing with the abstract paradigm 
alone. It remains to be seen which approach is more 
promising w.r.t. overall space and time efficiency. 

A second advantage of separating FSA guidance 
from constraint processing, as compared to pure 
finite-state transducer approaches, is that we are free 
to build sufficient expressivity into the constraint lan- 
guage. For example it seems that one needs token 
identity, i.e. structure sharing, in phonology to cover 



instances of antigemination, assimilation, dissimila- 
tion and reduplication in an insightful way. It is well- 
known that token identity is not finite-state repre- 
sentable and cumbersome to emulate in practice (cf. 
Antworth 1990, 157 on a FST attempt at reduplica- 
tion vs the DPM treatment of infixal reduplication 
in Tigrinya verbs described in Walther 1997, 238- 
247). Also, it would be fascinating to extend the 
constraint-based approach to phonetics. However, a 
pilot study reported in Walther & Kroger (1994) has 
found it necessary to use arithmetic constraints to 
do so, again transcending finite-state power. Finally, 
to the extent that sign-based approaches to grammar 
like HPSG are on the right track, the smooth integra- 
tion of phonology and morphology arguably is bet- 
ter achieved within a uniform formal basis such as 
MicroCUF which is expressive enough to cover the 
recursive aspects of syntax and semantics as well. 

In conclusion, some notes on the pilot implemen- 
tation. The MicroCUF system was modified to pro- 
duce two new incarnations of the MicroCUF inter- 
preter, one to record proof sequences, the other to 
perform FSA-guided proofs. FSA pam was created 
with the help of finite-state tools from AT&T's freely 
available fsm package (http: //www. research, 
att. com /sw /tools / fsm/). I have measured 
speedups of more than 10 2 for the generation of MH 
forms (< 1 second with the new technique), although 
parse times in the range of 1 ... 4 seconds on a Pen- 
tium 200 MHz PC with 64 MByte indicate that the 
current prototype is still too slow by a factor of more 
than 10 2 . However, there is ample room for future 
improvements. Besides drawing from the wealth of 
optimizations found in the logic programming liter- 
ature to generally accelerate MicroCUF (e.g., term 
encoding of feature structures, memoization) we can 
also analyze the internal structure of FS A para to gain 
some specific advantages. This is due to the fact that 
each maximal linear sub-FSA of length k > 1 corre- 
sponds to a deterministic proof subsequence whose 
clauses should be partially executable at compile 
time, subsequently saving k — 1 proof steps at run- 
time. 

5 Conclusion 

This paper has described a computational, declara- 
tive approach to prosodic morphology which uses 
inviolable constraints formulated in a sufficiently ex- 
pressive formalism (here: MicroCUF) together with 
a restrictive incremental optimization component. 
The approach has been illustrated by implementing 
an a-templatic analysis of a fragment of Modern He- 



brew verbs. The full grammar behind the illustrative 
fragment covers additional detail such as antigem- 
ination effects (noded-im, *nodd-im 'they (masc.) 
wander'), spirantization, B7 sibilant metathesis, etc. 
Also, the formalization of X/0 presented here is 
actually a special case of the more powerful no- 
tion of resequencing, whose application to Tigrinya 
vowel coalescence and metathesis was demonstrated 
in Walther (1997). 

Despite the initial emphasis on incremental op- 
timization, a compilation technique was later pro- 
posed to remove the need for run-time optimiza- 
tion and guarantee fully bidirectional processing of 
prosodic morphology. Although the general idea of 
using a finite-state oracle to guide a parser has 
been previously proposed for context-free grammars 
(Johnson 1996), both the details of our implemen- 
tation of the idea and its specific application to 
prosodic morphology are believed to be novel. It was 
emphasized how the proposed technique aided in a 
simple treatment of unknown words. Note that un- 
known words are not normally integrated into finite- 
state transducer models of prosodic morphology, al- 
though the necessary extensions appear to be pos- 
sible (K. Beesley, p.c). Finally, the fact that a hy- 
brid setup rather than a pure finite-state approach 
was chosen has been motivated inter alia by refer- 
ence to additional phenomena such as antigemina- 
tion and reduplication that require the richer notion 
of token identity. Future research will especially fo- 
cus on detailed analyses of reduplication phenomena 
to secure the relevance of the present approach to 
prosodic morphology at large. 
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